Search CORE

5 research outputs found

A novel approach for code smell detection : an empirical study

Author: Dewangan Seema
Gupta Manjari
Mishra Alok
Rao Rajwant Singh
Publication venue
Publication date: 01/01/2021
Field of study

Code smells detection helps in improving understandability and maintainability of software while reducing the chances of system failure. In this study, six machine learning algorithms have been applied to predict code smells. For this purpose, four code smell datasets (God-class, Data-class, Feature-envy, and Long-method) are considered which are generated from 74 open-source systems. To evaluate the performance of machine learning algorithms on these code smell datasets, 10-fold cross validation technique is applied that predicts the model by partitioning the original dataset into a training set to train the model and test set to evaluate it. Two feature selection techniques are applied to enhance our prediction accuracy. The Chi-squared and Wrapper-based feature selection techniques are used to improve the accuracy of total six machine learning methods by choosing the top metrics in each dataset. Results obtained by applying these two feature selection techniques are compared. To improve the accuracy of these algorithms, grid search-based parameter optimization technique is applied. In this study, 100% accuracy was obtained for the Long-method dataset by using the Logistic Regression algorithm with all features while the worst performance 95.20 % was obtained by Naive Bayes algorithm for the Long-method dataset using the chi-square feature selection technique.publishedVersio

Brage HiM

A study of dealing class imbalance problem with machine learning methods for code smell severity detection using PCA-based feature selection technique

Author: Alok Mishra
Manjari Gupta
Rajwant Singh Rao
Seema Dewangan
Publication venue: Nature Portfolio
Publication date: 01/09/2023
Field of study

Abstract Detecting code smells may be highly helpful for reducing maintenance costs and raising source code quality. Code smells facilitate developers or researchers to understand several types of design flaws. Code smells with high severity can cause significant problems for the software and may cause challenges for the system's maintainability. It is quite essential to assess the severity of the code smells detected in software, as it prioritizes refactoring efforts. The class imbalance problem also further enhances the difficulties in code smell severity detection. In this study, four code smell severity datasets (Data class, God class, Feature envy, and Long method) are selected to detect code smell severity. In this work, an effort is made to address the issue of class imbalance, for which, the Synthetic Minority Oversampling Technique (SMOTE) class balancing technique is applied. Each dataset's relevant features are chosen using a feature selection technique based on principal component analysis. The severity of code smells is determined using five machine learning techniques: K-nearest neighbor, Random forest, Decision tree, Multi-layer Perceptron, and Logistic Regression. This study obtained the 0.99 severity accuracy score with the Random forest and Decision tree approach with the Long method code smell. The model's performance is compared based on its accuracy and three other performance measurements (Precision, Recall, and F-measure) to estimate severity classification models. The impact of performance is also compared and presented with and without applying SMOTE. The results obtained in the study are promising and can be beneficial for paving the way for further studies in this area

Directory of Open Access Journals

Code Smell Detection Using Ensemble Machine Learning Algorithms

Author: Alok Mishra
Manjari Gupta
Rajwant Singh Rao
Seema Dewangan
Publication venue: 'MDPI AG'
Publication date: 13/10/2022
Field of study

Code smells are the result of not following software engineering principles during software development, especially in the design and coding phase. It leads to low maintainability. To evaluate the quality of software and its maintainability, code smell detection can be helpful. Many machine learning algorithms are being used to detect code smells. In this study, we applied five ensemble machine learning and two deep learning algorithms to detect code smells. Four code smell datasets were analyzed: the Data class, the God class, the Feature-envy, and the Long-method datasets. In previous works, machine learning and stacking ensemble learning algorithms were applied to this dataset and the results found were acceptable, but there is scope of improvement. A class balancing technique (SMOTE) was applied to handle the class imbalance problem in the datasets. The Chi-square feature extraction technique was applied to select the more relevant features in each dataset. All five algorithms obtained the highest accuracy—100% for the Long-method dataset with the different selected sets of metrics, and the poorest accuracy, 91.45%, was achieved by the Max voting method for the Feature-envy dataset for the selected twelve sets of metrics

Multidisciplinary Digital Publishing Institute

Code Smell Detection Using Ensemble Machine Learning Algorithms

Author: Alok Mishra
Manjari Gupta
Rajwant Singh Rao
Seema Dewangan
Publication venue: MDPI AG
Publication date: 01/10/2022
Field of study

Directory of Open Access Journals

Limitations, progress and prospects of application of biotechnological tools in improvement of bamboo—a plant with extraordinary qualities

Author: A Ojha
A Sood
A. K. Dhawan
AAE Hassan
AB Zamora
AB Zamora
AK Mukherjee
AL Nadgir
B Tian
C Heylen
C Xuhe
CK John
CS Lin
CS Lin
CS Lin
CS Lin
CS Lin
CS Lin
CS Lin
CW Ho
D Ajithkumar
D Negi
DH Janzen
DQ Tang
E Friar
E Friar
EEE Diab
EJ Judziewicz
F Jullien
F Shirin
FD Hempel
G Bernier
GA Huttley
GR Rout
HC Chaturvedi
HK Nadha
HP Heng
HS Tsay
HY Zhang
ID Arya
ID Arya
IVR Rao
IVR Rao
J Gielis
JAT Teixeira da Silva
JG Torrey
JK Yu
JP Loh
JP Nitsch
K Gillis
K Hirimburegama
K Wang
KD Mudoi
KT Cheah
L Huang
LC Huang
LC Huang
M Banerjee
M Das
M Das
M Das
M Joshi
M Kobayashi
M Singh
M Singh
M Thiruvengadam
M Tran Thanh Van
M Watanable
MB Zhou
ML Marulanda
ML Marulanda
ML Yeh
ML Yeh
ML Yeh
MP Alexander
MP Dayan
MS Kumar
MV Shirgurkar
N Bag
N Bag
N Bystriakova
NA Barkley
OL Gamborg
P Bisht
P Das
P Das
P Kapoor
PD Keukeleire
Prutpongse
R Mehta
R Ravikumar
R Satsangi
R Vongvijitra
R Wiersma
R Yashoda
R Yashoda
Rajwant K. Kalia
RG Butenko
RK Agnihotri
RK Kalia
RK Kalia
RK Sharma
RK Verma
RL Banik
Rohtas Singh
RS Nadgauda
RS Nadgauda
RS Nadgauda
RU Schenk
S Arya
S Arya
S Arya
S Arya
S Arya
S Arya
S Ghosh
S Godbole
S Hu
S Kalia
S Maity
S Nayak
S Nayak
S Ogita
S Ogita
S Ogita
S Saxena
S Saxena
S Saxena
S Saxena
SA Ansari
SA Kelchner
SAM Nurul Islam
Sanjay Kalia
SH Woods
Sharbati R. Singh
SL Hu
SM Arshad
SM Chambers
SMSD Ramanayake
SMSD Ramanayake
SMSD Ramanayake
SMSD Ramanayake
SMSD Ramanayake
SR Singh
SR Singh
Sunita Dalal
SY Chen
T Murashige
T Werner
TK Scott
TR Hodkinson
TS Filgueiras
U Rao
V Sharma
VM Jimenez
WB Chiu
WC Chang
WC Chang
WJ Dong
X Lin
XC Lin
XP Li
Y Isagi
Y Mishra
Y Mishra
Y Mishra
Y Sun
Y Suyama
YG Jie
YH Hsu
YH Komatsu
YJ Zhang
Z Peng
Z Qiang
Z Zhi-jun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref